3574 results found.
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
10,637 sentences Production Status:
Newly created-in progress
Use:
Document Classification, Text categorisation
-
Paper title:Don’t Patronize Me! An Annotated Dataset with Patronizing and Condescending Language towards Vulnerable Communities
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Carla Perez Almendros | Don't Patronize Me! Dataset | /N |
Documentation:
None
Written
Evaluation Data,
Language Type:
Bilingual
Languages:
English German
Availability:
It will be freely available.
License:
Size:
259379 sentences Production Status:
Newly created-finished
Use:
Evaluation/Validation
-
Paper title:ContraCAT: Contrastive Coreference Analytical Templates for Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Dario Stojanovski | ContraPro Adversarial | /N |
Documentation:
None
Written
Evaluation Data,
Language Type:
Bilingual
Languages:
English German
Availability:
It will be freely available.
License:
Size:
38394 sentences Production Status:
Newly created-finished
Use:
Evaluation/Validation
-
Paper title:ContraCAT: Contrastive Coreference Analytical Templates for Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Dario Stojanovski | ContraCAT | /N |
Documentation:
None
Written
Lexicon,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
BSD-like
Size:
117000 synsets Production Status:
Existing-used
Use:
Corpus Creation/Annotation
-
Paper title:ContraCAT: Contrastive Coreference Analytical Templates for Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Dario Stojanovski | Wordnet | /N |
Documentation:
None
Written
Evaluation Data,
Language Type:
Bilingual
Languages:
English German
Availability:
Freely Available
License:
MIT License
Size:
12000 sentences Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:ContraCAT: Contrastive Coreference Analytical Templates for Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Dario Stojanovski | ContraPro | /N |
Documentation:
None
Written
Tagger/Parser,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Corpus Creation/Annotation
-
Paper title:ContraCAT: Contrastive Coreference Analytical Templates for Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Dario Stojanovski | Spacy Dependency Parser | /N |
Documentation:
None
Written
Software Toolkit,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
GNU General Public License v3
Size:
None Production Status:
Existing-used
Use:
Corpus Creation/Annotation
-
Paper title:ContraCAT: Contrastive Coreference Analytical Templates for Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Dario Stojanovski | Stanford Neural Coreference Resolution System | /N |
Documentation:
None
Written
Corpus,
Language Type:
Bilingual
Languages:
English German
Availability:
Freely Available
License:
Size:
22500000 sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:ContraCAT: Contrastive Coreference Analytical Templates for Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Dario Stojanovski | OpenSubtitles | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
234k tokens Production Status:
Existing-used
Use:
Knowledge Discovery/Representation
-
Paper title:Story Generation with Rich Details
-
Paper track:Short paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Fangzhou Zhai | InScript | /N |
Documentation:
yes
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Scraped from web; scripts and unofficial scrapes available
License:
Size:
2500000000 words Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:Similarity or deeper understanding? Analyzing the TED-Q dataset of evoked questions
-
Paper track:Short paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Matthijs Westera | Bookcorpus | /N |
Documentation:
https://yknzhu.wixsite.com/mbweb




